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5 <110> APPLICANT: Wittekind, Michael 
7 Weinheimer, Steven 

9 Zhang, Yaqun 

11 Goldfarb, Valentina 

15 <120> TITLE OF INVENTION: Modified Forms of Hepatitis C NS3 Protease for 

17 Facilitating Inhibitor Screening and Structural Studies 

19 of Protease: Inhibitor Complexes 

23 <130> FILE REFERENCE: DB17Sequences 
C--> 27 <140> CURRENT APPLICATION NUMBER: US/09/965,594 
C--> 29 <141> CURRENT FILING DATE: 2001-09-27 



33 


<150> PRIOR 


APPLICATION NUMBER: 


60/115,271 










35 


<151> PRIOR 


FILING DATE: 1999-01 


-08 












39 


<160> NUMBER OF 


SEQ 


ID NOS: 


26 














43 


<170> SOFTWARE: 


Patentln Ver . 2 . 


0 












47 


<210> SEQ ID NO: 


1 


















49 


<211> LENGTH 


: 182 


















51 


<212> TYPE: 


PRT 




















53 


<213> ORGANISM: 


Hepatitis C 


virus 












57 


<400> SEQUENCE: 


1 


















59 


Met Ala Pro 


He 


Thr 


Ala Tyr 


Ala 


Gin 


Gin Thr Arg Gly Leu 


Leu 


Gly 


61 


1 




5 








10 






15 




65 


Cys lie lie 


Thr 


Ser 


Leu Thr 


Gly 


Arg 


Asp Lys 


Asn Gin 


Val 


Glu 


Gly 


67 




20 








25 






30 






71 


Glu Val Gin 


He 


Val 


Ser Thr 


Ala 


Ala 


Gin Thr 


Phe Leu 


Ala 


Thr 


Cys 


73 


35 








40 






45 






77 


lie Asn Gly 


Val 


Cys 


Trp Thr 


Val 


Tyr 


His Gly 


Ala Gly 


Thr 


Arg 


Thr 


79 


50 






55 








60 








83 


lie Ala Ser 


Pro 


Lys 


Gly Pro 


Val 


He 


Gin Met 


Tyr Thr 


Asn 


Val 


Asp 


85 


65 






70 






75 








80 


89 


Lys Asp Leu 


Val 


Gly 


Trp Pro 


Ala 


Pro 


Gin Gly 


Ser Arg 


Ser 


Leu 


Thr 


91 






85 








90 






95 




95 


Pro Cys Thr 


Cys 


Gly 


Ser Ser 


Asp 


Leu 


Tyr Leu 


Val Thr 


Arg 


His 


Ala 


97 




100 








105 






110 






101 


Asp Val He 


Pro 


Val 


Arg Arg 


Arg 


Gly 


Asp Ser 


Arg Gly 


Ser 


Leu 


Leu 


103 


115 








120 






125 








107 


Ser Pro Arg 


Pro 


He 


Ser Tyr 


Leu 


Lys 


Gly Ser 


Ser Gly Gly 


Pro 


Leu 


109 


130 






135 








140 








113 


Leu Cys Pro Ala Gly 


His Ala 


Val 


Gly 


He Phe 


Arg Ala 


Ala 


Val 


Cys 


115 


145 






150 






155 








160 


119 


Thr Arg Gly Val 


Ala 


Lys Ala 


Val 


Asp 


Phe He 


Pro Val 


Glu 


Ser 


Leu 


121 






165 








170 






175 




125 


Glu Thr Thr 


Met 


Arg 


Ser 
















127 




180 




















133 


<210> SEQ ID NO 


: 2 


















135 


<211> LENGTH: 549 


















137 


<212> TYPE: 


DNA 




















139 


<213> ORGANISM: 


Hepatitis C 


virus 
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143 <400> SEQUENCE: 2 

145 atggctccga tcaccgctta cgctcagcag acccgtggtc tgctgggttg catcatcacc 60 
147 tccctgaccg gtcgtgacaa aaaccaggtt gaaggtgaag ttcagatcgt ttccaccgct 120 
149 gctcagacct tcctggctac ctgcatcaac ggtgtttgct ggaccgttta ccacggtgct 180 
151 ggtacccgta ccatcgcttc cccgaaaggt ccggttatcc agatgtacac caacgttgac 240 
153 aaagacctgg ttggttggcc ggctccgcag ggttcccgtt ccctgacccc gtgcacctgc 300 
155 ggttcctccg acctgtacct ggttacccgt cacgctgacg ttatcccggt tcgtcgtcgt 360 
157 ggtgactccc gtggttccct gctgtccccg cgtccgatct cctacctgaa aggttcctcc 420 
159 ggtggtccgc tgctgtgccc ggctggtcac gctgttggta tcttccgtgc tgctgtttgc 480 
161 acccgtggtg ttgctaaagc tgttgacttc atcccggttg aatccctgga aaccaccatg 540 
163 cgttcctga 549 
167 <210> SEQ ID NO: 3 
169 <211> LENGTH: 195 ' 
171 <212> TYPE: PRT 

173 <213> ORGANISM: Hepatitis C virus 
177 <400> SEQUENCE: 3 



179 


Met 


Lys 


Lys 


Lys 


Gly 


Ser 


Val 


Val 


He 


Val 


Gly 


Arg 


He 


Val 


Leu 


Asn 


181 


1 








5 










10 










15 




185 


Gly 


Ala 


Tyr 


Ala 


Gin 


Gin 


Thr 


Arg 


Gly 


Leu 


Leu 


Gly 


Cys 


He 


He 


Thr 


187 








20 










25 










30 






191 


Ser 


Leu 


Thr 


Gly Arg 


Asp 


Lys 


Asn 


Gin 


Val 


Glu 


Gly 


Glu 


Val 


Gin 


He 


193 






35 










40 










45 








197 


Val 


Ser 


Thr 


Ala 


Ala 


Gin 


Thr 


Phe 


Leu 


Ala 


Thr 


Cys 


He 


Asn 


Gly 


Val 


199 




50 










55 










60 










203 


Cys 


Trp 


Thr 


Val 


Tyr 


His 


Gly Ala 


Gly 


Thr 


Arg 


Thr 


He 


Ala 


Ser 


Pro 


205 


65 










70 










75 










80 


209 


Lys 


Gly 


Pro 


Val 


He 


Gin 


Met 


Tyr 


Thr 


Asn 


Val 


Asp 


Lys 


Asp 


Leu 


Val 


211 










85 










90 










95 




215 


Gly 


Trp 


Pro 


Ala 


Pro 


Gin 


Gly 


Ser 


Arg 


Ser 


Leu 


Thr 


Pro 


Cys 


Thr 


Cys 


217 








100 










105 










110 






221 


Gly 


Ser 


Ser 


Asp 


Leu 


Tyr 


Leu 


Val 


Thr 


Arg 


His 


Ala 


Asp 


Val 


He 


Pro 


223 






115 










120 










125 








227 


Val 


Arg 


Arg 


Arg 


Gly 


Asp 


Ser 


Arg 


Gly 


Ser 


Leu 


Leu 


Ser 


Pro 


Arg 


Pro 


229 




130 










135 










140 










233 


He 


Ser 


Tyr 


Leu 


Lys 


Gly 


Ser 


Ser 


Gly 


Gly 


Pro 


Leu 


Leu 


Cys 


Pro 


Ala 


235 


145 










150 










155 










160 


239 


Gly 


His 


Ala 


Val 


Gly 


He 


Phe 


Arg 


Ala 


Ala 


Val 


Cys 


Thr 


Arg 


Gly 


Val 


241 










165 










170 










175 




245 


Ala 


Lys 


Ala 


Val 


Asp 


Phe 


He 


Pro 


Val 


Glu 


Ser 


Leu 


Glu 


Thr 


Thr 


Met 


247 








180 










185 










190 







251 Arg Ser Pro 

253 195 

259 <210> SEQ ID NO: 4 

261 <211> LENGTH: 588 

263 <212> TYPE: DNA 

265 <213> ORGANISM: Hepatitis C virus 
269 <400> SEQUENCE: 4 

271 atgaaaaaaa aaggttccgt tgttatcgtc ggccgtatag tactgaacgg tgcttacgct 60 
273 cagcagactc gaggtctgct gggttgcatc atcacctccc tgaccggtcg tgacaaaaac 120 
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275 caggttgaag gtgaagttca gatcgtttcc accgctgctc agaccttcct ggctacctgc 180 
277 atcaacggtg tttgctggac cgtttaccac ggtgctggta cccgtaccat cgcttccccg 240 
279 aaaggtccgg ttatccagat gtacaccaac gttgacaaag acctggttgg ttggccggct 300 
281 ccgcagggtt cccgttccct gaccccgtgc acctgcggtt cctccgacct gtacctggtt 360 
283 acccgtcacg ctgacgttat cccggttcgt cgtcgtggtg actcccgtgg ttccctgctg 420 
285 tccccgcgtc cgatctccta cctgaaaggt tcctccggtg gtccgctgct gtgcccggct 480 
287 ggtcacgctg ttggtatctt ccgtgctgct gtttgcaccc gtggtgttgc taaagctgtt 540 
289 gacttcatcc cggttgaatc cctggaaacc accatgcgtt ccccgtga 588 
293 <210> SEQ ID NO: 5 
295 <211> LENGTH: 15 
297 <212> TYPE: PRT 

299 <213> ORGANISM: Hepatitis C virus 
303 <400> SEQUENCE: 5 

305 Gin Gin Thr Arg Gly Leu Leu Gly Cys He He Thr Ser Leu Thr 
307 15 10 15 

313 <210> SEQ ID NO: 6 
315 <211> LENGTH: 15 
317 <212> TYPE: PRT 

319 <213> ORGANISM: Hepatitis C virus 
323 <400> SEQUENCE: 6 

325 Gin Gin Thr Arg Gly Glu Glu Gly Cys Gin Glu Thr Ser Gin Thr 
327 1 5 10 15 

333 <210> SEQ ID NO : 7 
335 <211> LENGTH: 15 
337 <212> TYPE: PRT 

339 <213> ORGANISM: Hepatitis C virus 
343 <400> SEQUENCE: 7 

345 Gin Gin Thr Arg Gly Glu Glu Gly Cys Gin Gin Thr Ser Glu Thr 
347 1 5 10 15 

353 <210> SEQ ID NO: 8 
355 <211> LENGTH: 15 
357 <212> TYPE: PRT 

359 <213> ORGANISM: Hepatitis C virus 
363 <400> SEQUENCE: 8 

365 Gin Gin Thr Arg Gly Asn Gin Gly Cys Glu Lys Thr Ser Glu Thr 
367 15 10 15 

373 <210> SEQ ID NO: 9 
375 <211> LENGTH: 15 
377 <212> TYPE: PRT 

379 <213> ORGANISM: Hepatitis C virus 
3 83 <400> SEQUENCE: 9 

385 Gin Gin Thr Arg Gly Glu Gin Gly Cys Gin Lys Thr Ser His Thr 
387 1 5 10 15 . 

393 <210> SEQ ID NO: 10 
395 <211> LENGTH: 15 
397 <212> TYPE: PRT 

399 <213> ORGANISM: Hepatitis C virus 
403 <400> SEQUENCE: 10 

405 Gin Gin Thr Arg Gly Glu Gin Gly Cys Asp Glu Thr Ser Glu Thr 
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407 1 5 10 15 

413 <210> SEQ ID NO: 11 
415 <211> LENGTH: 15 
417 <212> TYPE: PRT 

419 <213> ORGANISM: Hepatitis C virus 
423 <400> SEQUENCE: 11 

425 Gin Gin Thr Arg Gly Glu Glu Gly Cys Glu Gin Thr Ser Glu Thr 
427 1 5 10 15 

433 <210> SEQ ID NO: 12 
435 <211> LENGTH: 195 
437 <212> TYPE: PRT 

439 <213> ORGANISM: Hepatitis C virus 
443 <400> SEQUENCE: 12 



445 


Met 


Lys 


Lys 


Lys 


Gly 


Ser 


Val 


Val 


He 


Val 


Gly Arg 


He 


Val 


Leu 


Asn 


447 


1 








5 










10 










15 




451 


Gly 


Ala 


Tyr 


Ala 


Gin 


Gin 


Thr 


Arg 


Gly 


Glu 


Glu 


Gly Cys 


Gin 


Glu 


Thr 


453 








20 










25 










30 






457 


Ser 


Gin 


Thr 


Gly Arg 


Asp 


Lys 


Asn 


Gin 


Val 


Glu 


Gly Glu Val 


Gin 


He 


459 






35 










40 










45 








463 


Val 


Ser 


Thr 


Ala 


Ala 


Gin 


Thr 


Phe 


Leu 


Ala 


Thr 


Cys 


He 


Asn 


Gly Val 


465 




50 










55 










60 










469 


Cys 


Trp 


Thr 


Val 


Tyr 


His 


Gly 


Ala 


Gly 


Thr 


Arg 


Thr 


He 


Ala 


Ser 


Pro 


471 


65 










70 










75 










80 


475 


Lys 


Gly 


Pro 


Val 


He 


Gin 


Met 


Tyr 


Thr 


Asn 


Val 


Asp 


Lys 


Asp 


Leu 


Val 


477 










85 










90 








95 




481 


Gly 


Trp 


Pro 


Ala 


Pro 


Gin 


Gly 


Ser 


Arg 


Ser 


Leu 


Thr 


Pro 


c ys 


Thr 


Cys 


483 








100 










105 










110 




487 


Gly 


Ser 


Ser 


Asp 


Leu 


Tyr 


Leu 


Val 


Thr 


Arg 


His 


Ala 


Asp 


Val 


He 


Pro 


489 






115 










120 










125 








493 


Val 


Arg 


Arg 


Arg 


Gly 


Asp 


Ser 


Arg 


Gly 


Ser 


Leu 


Leu 


Ser 


Pro 


Arg 


Pro 


495 




130 










135 










140 








499 


He 


Ser 


Tyr 


Leu 


Lys 


Gly 


Ser 


Ser 


Gly 


Gly 


Pro 


Leu 


Leu 


c ys 


Pro 


Ala 


501 


145 










150 










155 








160 


505 


Gly 


His 


Ala 


Val 


Gly 


He 


Phe 


Arg 


Ala 


Ala 


Val 


Cys 


Thr 


Arg 


Gly Val 


507 










165 










170 








175 




511 


Ala 


Lys 


Ala 


Val 


Asp, 


Phe 


He 


Pro 


Val 


Glu 


Ser 


Leu 


Glu 


Thr 


Thr 


Met 


513 








180 










185 










190 







517 Arg Ser Pro 

519 195 

525 <210> SEQ ID NO: 13 

527 <211> LENGTH: 588 

529 <212> TYPE: DNA 

531 <213> ORGANISM: Hepatitis C virus 
535 <400> SEQUENCE: 13 

537 atgaaaaaaa aaggatccgt tgttatcgtc ggccgtatag tactgaacgg tgcttacgct 60 
539 cagcagactc gaggtgagga gggttgccaa gaaacctccc agaccggtcg tgacaaaaac 120 
541 caggttgaag gtgaagttca gatcgtttcc accgctgctc agaccttcct ggctacctgc 180 
543 atcaacggtg tttgctggac cgtttaccac ggtgctggta cccgtaccat cgcttccccg 240 
545 aaaggtccgg ttatccagat gtacaccaac gttgacaaag acctggttgg ttggccggct 300 
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547 ccgcagggtt cccgttccct gaccccgtgc acctgcggtt cctccgacct gtacctggtt 360. 
549 acccgtcacg ctgacgttat cccggttcgt cgtcgtggtg actcccgtgg ttccctgctg 420 
551 tccccgcgtc cgatctccta cctgaaaggt tcctccggtg gtccgctgct gtgcccggct 480 
553 ggtcacgctg ttggtatctt ccgtgctgct gtttgcaccc gtggtgttgc taaagctgtt 540 
555 gacttcatcc cggttgaatc cctggaaacc accatgcgtt ccccgtga 588 
559 <210> SEQ ID NO: 14 
561 <211> LENGTH: 197 
563 <212> TYPE: PRT 

565 <213> ORGANISM: Hepatitis C virus 
569 <400> SEQUENCE: 14 



1 


Met 


Lys 


Lys 


Lys 


Gly 


Ser 


Val 


Val 


He 


Val 


Gly Arg 


He 


Asn 


Leu 


Ser 


D / J 


1 








5 










10 










15 




3 / / 


Gly Asp 


Thr 


Ala 


Tyr 


Ala 


Gin 


Gin 


Thr 


Arg 


Gly Glu 


Glu 


Gly 


Cys 


Gin 


d / y 








20 










25 










30 




583 


Glu 


Thr 


Ser 


Gin 


Thr Gly Arg Asp 


Lys 


Asn 


Gin 


Val 


Glu 


Gly Glu Val 


585 






35 










40 










45 








589 


Gin 


He 


Val 


Ser 


Thr 


Ala 


Ala 


Gin 


Thr 


Phe 


Leu 


Ala 


Thr 


Cys 


He 


Asn 


591 




50 










55 










60 








595 


Gly 


Val 


Cys 


Trp 


Thr 


Val 


Tyr 


His 


Gly 


Ala 


Gly 


Thr 


Arg 


Thr 


He 


Ala 


597 


65 










70 










75 








80 


601 


-Ser 


Pro 


Lys 


Gly 


Pro 


Val 


He 


Gin 


Met 


Tyr 


Thr 


Asn 


Val 


Asp 


Lys 


Asp 


603 










85 










90 








95 


607 


Leu 


Val 


Gly 


Trp 


Pro 


Ala 


Pro 


Gin 


Gly 


Ser 


Arg 


Ser 


Leu 


Thr 


Pro 


Cys 


609 








100 










105 










110 




613 


Thr 


Cys 


Gly 


Ser 


Ser 


Asp 


Leu 


Tyr 


Leu 


Val 


Thr 


Arg 


His 


Ala 


Asp 


Val 


615 






115 










120 








125 






619 


He 


Pro 


Val 


Arg Arg Arg Gly Asp 


Ser 


Arg 


Gly 


Ser 


Leu 


Leu 


Ser 


Pro 


621 




130 










135 








140 










625 


Arg 


Pro 


He 


Ser 


Tyr 


Leu 


Lys 


Gly 


Ser 


Ser Gly Gly 


Pro 


Leu 


Leu 


Cys 


627 


145 










150 










155 










160 


631 


Pro 


Ala 


Gly 


His 


Ala 


Val 


Gly 


He 


Phe 


Arg 


Ala 


Ala 


Val 


Cys 


Thr 


Arg 


633 










165 










170 








175 


637 


Gly Val 


Ala 


Lys 


Ala 


Val 


Asp 


Phe 


He 


Pro 


Val 


Glu 


Ser 


Leu 


Glu 


Thr 


639 








180 










185 










190 






643 


Thr 


Met 


Arg 


Ser 


Pro 

























645 195 
651 <210> SEQ ID NO: 15 
653 <211> LENGTH: 594 
655 <212> TYPE: DNA 

657 <213> ORGANISM: Hepatitis C virus 
661 <400> SEQUENCE : 15 

663 atgaaaaaaa aaggatccgt tgttatcgtc ggccgtatca acctgtccgg tgacaccgct 60 
665 tacgctcagc agactcgagg tgaggagggt tgccaagaaa cctcccagac cggtcgtgac 120 
667 aaaaaccagg ttgaaggtga agttcagatc gtttccaccg ctgctcagac cttcctggct 180 
669 acctgcatca acggtgtttg ctggaccgtt taccacggtg ctggtacccg taccatcgct 240 
671 tccccgaaag gtccggttat ccagatgtac accaacgttg acaaagacct ggttggttgg 300 
673 ccggctccgc agggttcccg ttccctgacc ccgtgcacct gcggttcctc cgacctgtac 360 
675 ctggttaccc gtcacgctga cgttatcccg gttcgtcgtc gtggtgactc ccgtggttcc 420 
677 ctgctgtccc cgcgtccgat ctcctacctg aaaggttcct ccggtggtcc gctgctgtgc 480 
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